Complexity of Constrained VC-Classes
نویسنده
چکیده
Let F be a class of n-dimensional binary vectors, i.e., functions f : X → {0, 1} where X = [n] ≡ {1, . . . , n} with a VC-dimension V C(F) = d. The classical result of Sauer says that the complexity of F is bounded as |F| ≤ d i=0 n i ≡ S(d, n). How does the complexity decrease as one further constrains the subset of allowed functions in F ? The paper defines a constraining parameter for binary functions, called the margin μh(x, y) of h at x ∈ [n], which is a form of confidence that h takes the value y at x. Let a sample ζ = {(xi, yi)} l i=1, xi ∈ [n], yi ∈ {0, 1}, and for N ≥ 0, consider a class HN (ζ) ⊆ F of functions h having μh(xi, yi) > N , 1 ≤ i ≤ l. The above question is answered by estimating the cardinality |HN (ζ)| as a function of the margin parameter N by E1 = 1+exp(−(l+2(N +1))/n)S(d, n). In the extreme case, where ζ is the maximal-size sample on which every h ∈ HN(ζ) has μζ(h) > N , the estimate is E2 = exp(− exp(−(2N + 1)))(1 + exp(−(l + 2(N + 1))/n))S(d, n)). The latter is exponentially smaller than E1 in the region of N < N ′, where N ′ is approximately (1/2) ln(n).
منابع مشابه
Sauer's Bound for a Notion of Teaching Complexity
This paper establishes an upper bound on the size of a concept class with given recursive teaching dimension (RTD, a teaching complexity parameter.) The upper bound coincides with Sauer’s well-known bound on classes with a fixed VC-dimension. Our result thus supports the recently emerging conjecture that the combinatorics of VC-dimension and those of teaching complexity are intrinsically interl...
متن کاملComplexity of VC-classes of sequences with long repetitive runs
The Vapnik-Chervonenkis (VC) dimension (also known as the trace number) and the Sauer-Shelah lemma have found applications in numerous areas including set theory, combinatorial geometry, graph theory and statistical learning theory. Estimation of the complexity of discrete structures associated with the search space of algorithms often amounts to estimating the cardinality of a simpler class wh...
متن کاملOn the complexity of constrained VC-classes
Sauer’s Lemma is extended to classes HN of binary-valued functions h on [n] = {1, . . . , n} which have a margin less than or equal to N on all x ∈ [n] with h(x) = 1, where the margin μh(x) of h at x ∈ [n] is defined as the largest non-negative integer a such that h is constant on the interval Ia(x) = [x− a, x+ a] ⊆ [n]. Estimates are obtained for the cardinality of classes of binary valued fun...
متن کاملRecursive teaching dimension, VC-dimension and sample compression
This paper is concerned with various combinatorial parameters of classes that can be learned from a small set of examples. We show that the recursive teaching dimension, recently introduced by Zilles et al. (2008), is strongly connected to known complexity notions in machine learning, e.g., the self-directed learning complexity and the VC-dimension. To the best of our knowledge these are the fi...
متن کاملRecursive Teaching Dimension, Learning Complexity, and Maximum Classes
This paper is concerned with the combinatorial structure of concept classes that can be learned from a small number of examples. We show that the recently introduced notion of recursive teaching dimension (RTD, reflecting the complexity of teaching a concept class) is a relevant parameter in this context. Comparing the RTD to self-directed learning, we establish new lower bounds on the query co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005